Mining for Analogous Tuples from an Entity-Relation Graph
نویسندگان
چکیده
The ability to recognize analogies is an important factor that is closely related to human intelligence. Verbal analogies have been used for evaluating both examinees at university entrance exams as well as algorithms for measuring relational similarity. However, relational similarity measures proposed so far are confined to measuring the similarity between pairs of words. Unfortunately, such pairwise approaches ignore the rich relational structure that exists in real-world knowledge bases containing millions of entities and semantic relations. We propose a method to efficiently identify analogous entity tuples from a given entity-relation graph. First, we present an efficient approach for extracting potential analogous tuples from a given entityrelation graph. Second, to measure the structural similarity between two tuples, we propose two types of kernel functions: vertex-feature kernels, and edge-feature kernels. Moreover, we combine those kernels to construct composite kernels that simultaneously consider both vertex and edge features. Experimental results show that our proposed method accurately identifies analogous tuples and significantly outperforms a state-of-the-art pairwise relational similarity measure, extended to tuples.
منابع مشابه
Towards a Structured Representation of Generic Concepts and Relations in Large Text Corpora
Extraction of structured information from text corpora involves identifying entities and the relationship between entities expressed in unstructured text. We propose a novel iterative pattern induction method to extract relation tuples exploiting lexical and shallow syntactic pattern of a sentence. We start with a single pattern to illustrate how the method explores additional paterns and tuple...
متن کاملAn Optimal Approach to Local and Global Text Coherence Evaluation Combining Entity-based, Graph-based and Entropy-based Approaches
Text coherence evaluation becomes a vital and lovely task in Natural Language Processing subfields, such as text summarization, question answering, text generation and machine translation. Existing methods like entity-based and graph-based models are engaging with nouns and noun phrases change role in sequential sentences within short part of a text. They even have limitations in global coheren...
متن کاملQuery Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملAn Efficient Computation of Reachability Labeling for Social Networking Using Graph Pattern Mining: an Application of Data Mining
Graphs form a powerful modeling tool to represent complex relationships among objects in an effective manner. Graph pattern matching is one of the areas of data mining where the data is stored in the form of graphs and the set of tuples that match a user-given graph pattern are extracted. For finding the set of matching tuples faster, all the possible paths in the large directed graph, i.e., tr...
متن کاملRelevance Score of Triplets Using Knowledge Graph Embedding - The Pigweed Triple Scorer at WSDM Cup 2017
Collaborative Knowledge Bases such as Freebase [1] and Wikidata [2] mention multiple professions and nationalities for a particular entity. The goal of the WSDM Cup 2017 [3] Triplet Scoring Challenge was to calculate relevance scores between an entity and its professions/nationalities. Such scores are a fundamental ingredient when ranking results in entity search. This paper proposes a novel ap...
متن کامل